Corpus: nep_news_2009_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 16 54 92 98 99
1000 217 607 881 980 993
10000 1632 4990 8117 9615 9916
100000 8512 33993 66999 89380 97745
1000000 8512 33994 67000 89381 97746


Zipf's diagram for sentence endings


Gnuplot diagram

5423 msec needed at 2018-03-17 00:10